Media content identification and playback

ABSTRACT

Systems, devices, apparatuses, components, methods, and techniques for identifying and playing media content are provided. An example media-playback device for identifying and playing media content for a user traveling in a vehicle includes an audio identification engine and a media playback engine. Audio content is recorded and identified by comparison to media content databases. The audio content is identified and immediately played on the same device. Additional media content is selected for playback based on user listening preferences.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of U.S. application Ser. No.15/721,049, filed on Sep. 29, 2018, titled MEDIA CONTENT IDENTIFICATIONAND PLAYBACK, which claims priority to U.S. Application No. 62/441,267,filed on Dec. 31, 2016, titled MEDIA CONTENT IDENTIFICATION ANDPLAYBACK, the disclosures of which are hereby incorporated by referencein their entireties. To the extent appropriate a claim of priority ismade to each of the above-disclosed applications.

BACKGROUND

Many people enjoy consuming media content while travelling or duringother activities. The media content can include audio content, videocontent, or other types of media content. Examples of audio contentincludes songs, albums, podcasts, audiobooks, etc. Examples of videocontent includes movies, music videos, television episodes, etc. Using amobile phone or other media-playback device, such as avehicle-integrated media playback device, a person can access largecatalogs of media content. For example, a user can access an almostlimitless catalog of media content through various free andsubscription-based streaming services. Additionally, a user can store alarge catalog of media content on his or her mobile device.

This nearly limitless access to media content introduces new challengesfor users. For example, it may be difficult to find or select desiredmedia content when the user does not know any identifying informationfor the media content.

SUMMARY

In general terms, this disclosure is directed to media identificationand playback. In one possible configuration and by non-limiting example,a media-playback device generates a user interface that is specificallyconfigured to identify audio content and automatically play back a mediaitem corresponding to the audio content along with a related mediaplaylist. Various aspects are described in this disclosure, whichinclude, but are not limited to, the following aspects.

In one aspect, a media playback device comprises: a sound-sensing deviceconfigured to record audio content played by an external audio source; acontent output device; a memory device encoding instructions that, whenexecuted by a processor, cause the media playback device to record aportion of the audio content from the external audio source, identifythe audio content, receive a media item corresponding to the audiocontent, and automatically initiate playback of the media item on thecontent output device before playback of the audio content by theexternal source has concluded.

In another aspect, a method of identifying audio content comprises:recording audio content from an external audio source; analyzing theaudio content to identify an acoustic fingerprint; comparing theacoustic fingerprint of the audio content with a database of acousticfingerprints; identifying a media item matching the audio contentacoustic fingerprint; and automatically playing the media item.

In yet another aspect, a system for identifying and playing audiocontent comprises: an audio analysis server application configured toanalyze audio content received from a mobile device, identify anacoustic fingerprint of the audio content, compare the acousticfingerprint to an acoustic fingerprint database, and identify a mediaitem having the same acoustic fingerprint as the audio content; and amedia server application configured to access the media item from amedia content database, determine additional media content to include ina related media playlist, and communicate the related media playlist tothe mobile device.

In another aspect, one or more computer-readable media havecomputer-executable instructions embodied thereon that, when executed byone or more computing devices, cause the computing devices to: recordaudio content from an external audio source; analyze the audio contentto identify an acoustic fingerprint; compare the acoustic fingerprint ofthe audio content with a database of acoustic fingerprints; identify amedia item matching the audio content acoustic fingerprint; andautomatically play the media item.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates an example media-playback system for mediaidentification and playback.

FIG. 2 is a schematic illustration of the example media-playback systemof FIG. 1.

FIG. 3A is a schematic diagram of an embodiment of the media-playbacksystem of FIG. 1.

FIG. 3B is a schematic diagram of another embodiment of themedia-playback system of FIG. 1.

FIG. 3C is a schematic diagram of another embodiment of themedia-playback system of FIG. 1.

FIG. 3D is a schematic diagram of another embodiment of themedia-playback system of FIG. 1.

FIG. 3E is a schematic diagram of another embodiment of themedia-playback system of FIG. 1.

FIG. 3F is a schematic diagram of another embodiment of themedia-playback system of FIG. 1.

FIG. 4 is a schematic illustration of an embodiment of the vehiclemedia-playback system of FIG. 3A.

FIG. 5A is a schematic diagram of an embodiment of a vehiclemedia-playback system that incorporates a media-playback device.

FIG. 5B is another schematic diagram of an embodiment of the vehiclemedia-playback system of FIG. 5B.

FIG. 6 is a schematic block diagram of an embodiment of the audioidentification engine of FIG. 2.

FIG. 7 illustrates an example method of identifying and playing mediacontent that is performed by some embodiments of the media-playbacksystem of FIG. 1.

FIG. 8A illustrates an example method of recording audio content that isperformed by some embodiments of the audio identification engine of FIG.2.

FIG. 8B illustrates an alternative example method of recording audiocontent that is performed by some embodiments of the audioidentification engine of FIG. 2.

FIG. 9 illustrates an embodiment of the media-playback device of FIG. 1operating in a vehicle.

FIG. 10 illustrates an example method of analyzing and identifying audiocontent that is performed by some embodiments of the audioidentification engine of FIG. 2.

FIG. 11 illustrates an example method of playing media content that isperformed by some embodiments of the audio identification engine of FIG.2.

FIG. 12 illustrates an embodiment of the media-playback device of FIG. 1displaying a user interface.

FIG. 13 illustrates another embodiment of a user interface displayed onthe media-playback device of FIG. 1.

DETAILED DESCRIPTION

Various embodiments will be described in detail with reference to thedrawings, wherein like reference numerals represent like parts andassemblies throughout the several views. Reference to variousembodiments does not limit the scope of the claims attached hereto.Additionally, any examples set forth in this specification are notintended to be limiting and merely set forth some of the many possibleembodiments for the appended claims.

Users of media-playback devices often consume media content duringvarious activities, including travelling. As used herein, the term“travel” and variants thereof refers to any activity in which a user isin transit between two locations. For example, a user is in transit whenbeing conveyed by a vehicle, including motorized and non-motorized,public transit and private vehicles. A user is also in transit whenmoving between locations by other means such as walking and running.

Enjoying media content while travelling presents many challenges. First,it can be difficult to safely interact with a media-playback devicewhile traveling in a manner that does not interfere with travel relatedactivities (driving, navigating, etc.) Second, desired media content maynot be available or accessible in a format that can be accessed whiletravelling. Third, accessing media content while travelling may bedifficult, expensive, or impossible depending on networkavailability/capacity along the route of travel. Fourth, accessing andplaying back media content can require significant amounts of energy,potentially draining a battery in the media-playback device. Fifth, itcan be challenging to connect a media-playback device to avehicle-embedded audio system for playback while travelling in avehicle. Embodiments disclosed herein address some or all of thesechallenges. It should be understood, however, that aspects describedherein are not limited to use during travel.

For example, in some embodiments, the media-playback device includes alimited-attention interface that requires less attention from the userand/or is less distracting than a standard interface. Thislimited-attention interface can be useful during travelling because auser may have limited attention available for interacting with amedia-playback device due to the need to concentrate on travel relatedactivities, including for example driving and navigating. But thelimited-attention interface can also be configured for use playing backmedia content during other activities that require the user'sconcentration, such as exercising, playing games, operating heavyequipment, reading, studying, etc.

Consuming media content may include one or more of listening to audiocontent, watching video content, or consuming other types of mediacontent. For ease of explanation, the embodiments described in thisapplication are presented using specific examples. For example, audiocontent (and in particular music) is described as an example of one formof media consumption. As another example, travelling (and in particulardriving) is described as one example of an activity during which mediacontent is consumed. However, it should be understood that the sameconcepts are similarly applicable to other forms of media consumptionand to other activities, and at least some embodiments include otherforms of media consumption and/or are configured for use during otheractivities.

FIG. 1 illustrates an example system 100 for media identification andplayback. The example system 100 includes a media-playback device 102and a media-delivery system 104. The media-playback device 102 includesan audio identification engine 108 and a user interface 112. The system100 communicates across a network 106. Also shown, is a user U travelingin a vehicle V who is hearing an external audio source A.

The media-playback device 102 plays back media content items to producemedia output 110. In some embodiments, the media content items areprovided by the media-delivery system 104 and transmitted to themedia-playback device 102 using the network 106. A media content item isan item of media content, including audio, video, or other types ofmedia content, which may be stored in any format suitable for storingmedia content. Non-limiting examples of media content items includesongs, albums, audiobooks, music videos, movies, television episodes,podcasts, other types of audio or video content, and portions orcombinations thereof.

The media-playback device 102 plays media content for the user duringtravel or other activities. The media content that is played back may beselected based on user input or may be selected without user input. Themedia content may be selected for playback without user input by eitherthe media-playback device 102 or the media-delivery system 104. Forexample, media content can be selected for playback without user inputbased on stored user profile information, location, travel conditions,current events, and other criteria. User profile information includesbut is not limited to user preferences and historical information aboutthe user's consumption of media content. User profile information canalso include libraries and/or playlists of media content itemsassociated with the user. User profile information can also includeinformation about the user's relationships with other users (e.g.,associations between users that are stored by the media-delivery system104 or on a separate social media site). Although the media-playbackdevice 102 is shown as a separate device in FIG. 1, the media-playbackdevice 102 can also be integrated with the vehicle (e.g., as part of adash-mounted vehicle infotainment system).

A user interface 112 is generated for selecting and playing back mediacontent items. In at least some embodiments, the user interface 112allows a user U to prompt the audio identification engine 108 toidentify audio content from an external audio source A. In someembodiments, the external audio source A is a vehicle media-playbacksystem. In other embodiments, the external audio source can be a radio,a CD player, a juke box, or other audio producing devices. The audioidentification engine 108 then functions to record the audio content,analyze the audio content, and communicate with the media deliverysystem 104 to retrieve a media item corresponding to the audio content.The media item is presented on the user interface 112 and isautomatically played on the media-playback device 102 as media output110.

FIG. 2 is a schematic illustration of the example media-playback system100 of FIG. 1. In FIG. 2, the media-playback device 102, themedia-delivery system 104, and the network 106 are shown. Also shown arethe user U, the external audio source A, and satellites S.

As noted above, the media-playback device 102 plays media content items.In some embodiments, the media-playback device 102 plays media contentitems that are provided (e.g., streamed, transmitted, etc.) by a systemexternal to the media-playback device such as the media-delivery system104, another system, or a peer device. Alternatively, in someembodiments, the media-playback device 102 plays media content itemsstored locally on the media-playback device 102. Further, in at leastsome embodiments, the media-playback device 102 plays media contentitems that are stored locally as well as media content items provided byother systems.

In some embodiments, the media-playback device 102 is a computingdevice, handheld entertainment device, smartphone, tablet, watch,wearable device, or any other type of device capable of playing mediacontent. In yet other embodiments, the media-playback device 102 is anin-dash vehicle computer, laptop computer, desktop computer, television,gaming console, set-top box, network appliance, Blu-Ray™ or DVD player,media player, stereo, or radio.

In at least some embodiments, the media-playback device 102 includes alocation-determining device 150, a touch screen 152, a processing device154, a memory device 156, a content output device 158, amovement-detecting device 160, a network access device 162, asound-sensing device 164, and an optical-sensing device 166. Otherembodiments may include additional, different, or fewer components. Forexample, some embodiments do not include one or more of thelocation-determining device 150, the touch screen 152, themovement-detecting device 160, and the optical-sensing device 166.

The location-determining device 150 is a device that determines thelocation of the media-playback device 102. In some embodiments, thelocation-determining device 150 uses one or more of the followingtechnologies: Global Positioning System (GPS) technology which mayreceive GPS signals 174 from satellites S, cellular triangulationtechnology, network-based location identification technology, Wi-Fipositioning systems technology, and combinations thereof.

The touch screen 152 operates to receive an input 176 from a selector(e.g., a finger, stylus, etc.) controlled by the user U. In someembodiments, the touch screen 152 operates as both a display device anda user input device. In some embodiments, the touch screen 152 detectsinputs based on one or both of touches and near-touches. In someembodiments, the touch screen 152 displays a user interface 112 forinteracting with the media-playback device 102. As noted above, someembodiments do not include a touch screen 152. Some embodiments includea display device and one or more separate user interface devices.Further, some embodiments do not include a display device.

In some embodiments, the processing device 154 comprises one or morecentral processing units (CPU). In other embodiments, the processingdevice 154 additionally or alternatively includes one or more digitalsignal processors, field-programmable gate arrays, or other electroniccircuits. The processing device 154 executes instructions store in thememory device 156.

The memory device 156 operates to store data and instructions. In someembodiments, the memory device 156 encodes instructions for amedia-playback engine 170 that includes the audio identification engine108. In some embodiments, the media-playback engine 170 selects andplays back media content and generates interfaces for selecting andplaying back media content items. In some embodiments, themedia-playback engine 170 works with the audio identification engine 108to select and play media content items. As described above, the audioidentification engine 108 identifies audio content from external sourcesand initiates playback of media content items through the media-playbackengine that match and are related to the audio content.

In at least some embodiments, the audio identification engine 108records and analyzes audio content from external sources. The audioidentification engine 108 then accesses a matching media content itemfrom the media-delivery system 104 and immediately initiates playback ofthe media content item through the media-playback engine 170. Noadditional interaction from the user U is required to play the mediacontent item. Additionally, the audio identification engine 108 mayaccess additional media content from the media-delivery system 104 whichis then played by the media-playback engine 170.

Some embodiments of the memory device also include a media content cache172. The media content cache 172 stores media-content items, such asmedia content items that have been previously received from themedia-delivery system 104. The media content items stored in the mediacontent cache 172 may be stored in an encrypted or unencrypted format.The media content cache 172 can also store decryption keys for some orall of the media content items that are stored in an encrypted format.The media content cache 172 can also store metadata about media-contentitems such as title, artist name, album name, length, genre, mood, era,etc. The media content cache 172 can also store playback informationabout the media content items, such as the number of times the user hasrequested to playback the media content item or the current location ofplayback (e.g., when the media content item is an audiobook, podcast, orthe like for which a user may wish to resume playback).

The memory device 156 typically includes at least some form ofcomputer-readable media. Computer readable media includes any availablemedia that can be accessed by the media-playback device 102. By way ofexample, computer-readable media include computer readable storage mediaand computer readable communication media.

Computer readable storage media includes volatile and nonvolatile,removable and non-removable media implemented in any device configuredto store information such as computer readable instructions, datastructures, program modules, or other data. Computer readable storagemedia includes, but is not limited to, random access memory, read onlymemory, electrically erasable programmable read only memory, flashmemory and other memory technology, compact disc read only memory, blueray discs, digital versatile discs or other optical storage, magneticcassettes, magnetic tape, magnetic disk storage or other magneticstorage devices, or any other medium that can be used to store thedesired information and that can be accessed by the media-playbackdevice 102. In some embodiments, computer readable storage media isnon-transitory computer readable storage media.

Computer readable communication media typically embodies computerreadable instructions, data structures, program modules or other data ina modulated data signal such as a carrier wave or other transportmechanism and includes any information delivery media. The term“modulated data signal” refers to a signal that has one or more of itscharacteristics set or changed in such a manner as to encode informationin the signal. By way of example, computer readable communication mediaincludes wired media such as a wired network or direct-wired connection,and wireless media such as acoustic, radio frequency, infrared, andother wireless media. Combinations of any of the above are also includedwithin the scope of computer readable media.

The content output device 158 operates to output media content. In someembodiments, the content output device 158 generates media output 110for the user U. In some embodiments, the media output 110 is directedinto an interior cabin of the vehicle V. Examples of the content outputdevice 158 include a speaker assembly comprising one or more speakers,an audio output jack, a Bluetooth transmitter, a display panel, and avideo output jack. Other embodiments are possible as well. For example,the content output device 158 may transmit a signal through the audiooutput jack or Bluetooth transmitter that can be used to reproduce anaudio signal by a connected or paired device such as headphones, speakersystem, or vehicle head unit.

The movement-detecting device 160 senses movement of the media-playbackdevice 102. In some embodiments, the movement-detecting device 160 alsodetermines an orientation of the media-playback device 102. In at leastsome embodiments, the movement-detecting device 160 includes one or moreaccelerometers or other motion-detecting technologies ororientation-detecting technologies. As an example, themovement-detecting device 160 may determine an orientation of themedia-playback device 102 with respect to a primary direction ofgravitational acceleration. The movement-detecting device 160 may detectchanges in the determined orientation and interpret those changes asindicating movement of the media-playback device 102. Themovement-detecting device 160 may also detect other types ofacceleration of the media-playback device and interpret the accelerationas indicating movement of the media-playback device 102 too.

The network access device 162 operates to communicate with othercomputing devices over one or more networks, such as the network 106.Examples of the network access device 162 include one or more wirednetwork interfaces and wireless network interfaces. Examples of wirelessnetwork interfaces include infrared, BLUETOOTH® wireless technology,802.11a/b/g/n/ac, and cellular or other radio frequency interfaces.

The network 106 is an electronic communication network that facilitatescommunication between the media-playback device 102 and themedia-delivery system 104. An electronic communication network includesa set of computing devices and links between the computing devices. Thecomputing devices in the network use the links to enable communicationamong the computing devices in the network. The network 106 can includerouters, switches, mobile access points, bridges, hubs, intrusiondetection devices, storage devices, standalone server devices, bladeserver devices, sensors, desktop computers, firewall devices, laptopcomputers, handheld computers, mobile telephones, vehicular computingdevices, and other types of computing devices.

In various embodiments, the network 106 includes various types of links.For example, the network 106 can include wired and/or wireless links,including Bluetooth, ultra-wideband (UWB), 802.11, ZigBee, cellular, andother types of wireless links. Furthermore, in various embodiments, thenetwork 106 is implemented at various scales. For example, the network106 can be implemented as one or more vehicle are networks, local areanetworks (LANs), metropolitan area networks, subnets, wide area networks(such as the Internet), or can be implemented at another scale. Further,in some embodiments, the network 106 includes multiple networks, whichmay be of the same type or of multiple different types.

The sound-sensing device 164 senses sounds proximate the media-playbackdevice 102 (e.g., ambient sounds within the environment in which themedia-playback device 102 is located). In some embodiments, thesound-sensing device 164 comprises one or more microphones. For example,the sound-sensing device 164 may capture a recording of audio contentfrom an external audio source A proximate the media-playback device 102.These recordings may be analyzed by the audio identification engine 108to determine the identity of the audio content and retrieve matchingmedia content from the media-delivery system 104.

These recordings may also be analyzed by the media-playback device 102using speech-recognition technology to identify words spoken by theuser. The words may be recognized as commands from the user that alterthe behavior of the media-playback device 102 and the playback of mediacontent by the media-playback device 102. The words and/or recordingsmay also be analyzed by the media-playback device 102 using naturallanguage processing and/or intent-recognition technology to determineappropriate actions to take based on the spoken words. Additionally oralternatively, the sound-sensing device may determine various soundproperties about the sounds proximate the user such as volume, dominantfrequency or frequencies, etc. These sound properties may be used tomake inferences about the environment proximate to the media-playbackdevice 102 such as whether the sensed sounds are likely to correspond toa private vehicle, public transportation, etc. In some embodiments,recordings captured by the sound-sensing device 164 are transmitted tomedia-delivery system 104 (or another external server) for analysisusing speech-recognition and/or intent-recognition technologies.

The optical-sensing device 166 senses optical signals proximate themedia-playback device 102. In some embodiments, the optical-sensingdevice 166 comprises one or more light sensors or cameras. For example,the optical-sensing device 166 may capture images or videos. Thecaptured images can be processed (by the media-playback device 102 or anexternal server such as the media-delivery system 104 to which theimages are transmitted) to detect gestures, which may then beinterpreted as commands to change the playback of media content.Similarly, a light sensor can be used to determine various properties ofthe environment proximate the user computing device, such as thebrightness and primary frequency (or color or warmth) of the light inthe environment proximate the media-playback device 102. Theseproperties of the sensed light may then be used to infer whether themedia-playback device 102 is in an indoor environment, an outdoorenvironment, a private vehicle, public transit, etc.

The media-delivery system 104 comprises one or more computing devicesand provides media content items to the media-playback device 102 and,in some embodiments, other media-playback devices as well. Themedia-delivery system 104 includes a media server 180. Although FIG. 2shows a single media server 180, some embodiments include multiple mediaservers. In these embodiments, each of the multiple media servers may beidentical or similar and may provide similar functionality (e.g., toprovide greater capacity and redundancy, or to provide services frommultiple geographic locations). Alternatively, in these embodiments,some of the multiple media servers may perform specialized functions toprovide specialized services (e.g., services to enhance media contentplayback during travel or other activities, etc.). Various combinationsthereof are possible as well.

The media server 180 transmits stream media 218 to media-playbackdevices such as the media-playback device 102. In some embodiments, themedia server 180 includes a media server application 184, an audioanalysis server application 185, a travel server application 186, aprocessing device 188, a memory device 190, and a network access device192. The processing device 188, memory device 190, and network accessdevice 192 may be similar to the processing device 154, memory device156, and network access device 162 respectively, which have each beenpreviously described.

In some embodiments, the media server application 184 streams music orother audio, video, or other forms of media content. The media serverapplication 184 includes a media stream service 194, a media data store196, and a media application interface 198. The media stream service 194operates to buffer media content such as media content items 206, 208,and 210, for streaming to one or more streams 200, 202, and 204.

The media application interface 198 can receive requests or othercommunication from media-playback devices or other systems, to retrievemedia content items from the media server 180. For example, in FIG. 2,the media application interface 198 receives communication 234 from theaudio identification engine 108.

In some embodiments, the media data store 196 stores media content items212, media content metadata 214, and playlists 216. The media data store196 may comprise one or more databases and file systems. As noted above,the media content items 212 may be audio, video, or any other type ofmedia content, which may be stored in any format for storing mediacontent.

The media content metadata 214 operates to provide various informationassociated with the media content items 212. In some embodiments, themedia content metadata 214 includes one or more of title, artist name,album name, length, genre, mood, era, etc. In some embodiments, themedia content metadata 214 indicates whether a particular user likes aparticular media content item 212 or whether that media content item 212is similar to other media content items 212 that a particular user likesor dislikes. The playlists 216 operate to identify one or more of themedia content items 212 and assemble them into a programmed list. Insome embodiments, the playlists 216 identify a group of the mediacontent items 212 in a particular order. In other embodiments, theplaylists 216 merely identify a group of the media content items 212without specifying a particular order. Some, but not necessarily all, ofthe media content items 212 included in a particular one of theplaylists 216 are associated with a common characteristic such as acommon genre, mood, or era. The playlists 216 may include user-createdplaylists, which may be available to a particular user, a group ofusers, or to the public.

The audio analysis server application 185 operates to compare anacoustic fingerprint for audio content received from the audioidentification engine 108 with an acoustic fingerprint data store 221.In some embodiments, the audio identification engine 108 analyzes theaudio content to generate an acoustic fingerprint which is thencommunicated to the acoustic analysis engine 220. In other embodiments,the audio identification engine 108 communicates the recording of theaudio content to the acoustic analysis engine 220 which then generatesthe acoustic fingerprint. The acoustic fingerprint is then compared tothe acoustic fingerprint data store 221 to find a match.

Upon finding a matching media content item, the acoustic analysis engine220 communicates identifying information, such as media content metadata214, along with the media content item to the audio identificationengine 108. The identifying information can be displayed on a visualdisplay of the user interface 112 of the media-playback device 102and/or be announced verbally through the content output device 158.

Additionally, the acoustic analysis engine 220 communicates with themedia server application 184 to generate a playlist of related mediacontent items 212 and communicate the playlist to the media-playbackengine 170. In some embodiments, the playlist includes media contentitems 212 selected to match listening preferences of the user. Suchpreferences may be based on past listening behavior and playlists madeby the user. Alternatively, one of the existing playlists 216 related tothe media content item may be communicated to the media-playback engine170 for playback on the media-playback device 102. In some embodiments,the media item and playlist of additional media content are songs andthe playlists are related to the media item by one or more of genre,artist, album, level of popularity, and mood.

The travel server application 186 provides travel-specific functionalityfor providing media content items and interfaces for accessing mediacontent items to media-playback devices. In some embodiments, the travelserver application 186 includes a travel application interface 222 and atravel data store 224.

The travel application interface 222 can receive requests or othercommunication from media-playback devices or other systems, to retrievetravel information and media content items for playback during travel.For example, in FIG. 2, the travel application interface 222 receivescommunication 236 from the media-playback engine 170.

Additionally, the travel server application 186 can process data anduser input information received from the media-playback device 102. Insome embodiments, travel server application 186 operates to transmitinformation about the suitability of one or more media content items forplayback during travel. In some embodiments, the travel serverapplication 186 may provide a list of media content items that aresuited to particular geographic locations, travel conditions, modes oftravel, user preferences, etc.

For example, the travel server application 186 may store metadata andother information that associates media content items with geographiclocations, forms of travel, travel conditions, etc. in the travel datastore 224. The travel server application 186 may also store informationthat associates media content items with an individual or group ofusers' preferences for consuming that media content during travel in thetravel data store 224. The travel data store 224 may comprise one ormore files or databases. The travel data store 224 can also comprisefiles, tables, or fields in the media data store 196.

In some embodiments, the travel data store 224 stores travel mediametadata. The travel media metadata may include various types ofinformation associated with media content items, such as the mediacontent items 212 in the media data store 196. In some embodiments, thetravel data store 224 provides information that may be useful forselecting media content items for playback during travel. For example,in some embodiments, the travel data store 224 stores travel scores formedia content items that correspond to the suitability of particularmedia content items for playback during travel. As another example, insome embodiments, the travel data store 224 stores timestamps (e.g.,start and end points) that identify portions of media content items thatare particularly well-suited for playback during travel (or otheractivities).

Each of the media-playback device 102 and the media-delivery system 104can include additional physical computer or hardware resources. In atleast some embodiments, the media-playback device 102 communicates withthe media-delivery system 104 via the network 106.

Although in FIG. 2 only a single media-playback device 102 andmedia-delivery system 104 are shown, in accordance with someembodiments, the media-delivery system 104 can support the simultaneoususe of multiple media-playback devices, and the media-playback devicecan simultaneously access media content from multiple media-deliverysystems. Additionally, although FIG. 2 illustrates a streaming mediabased system for media playback during travel, other embodiments arepossible as well. For example, in some embodiments, the media-playbackdevice 102 includes a media data store 196 and the media-playback device102 is configured to select and playback media content items withoutaccessing the media-delivery system 104. Further, in some embodiments,the media-playback device 102 operates to store previously streamedmedia content items in a local media data store (e.g., the media contentcache 172).

In at least some embodiments, the media-delivery system 104 can be usedto stream, progressively download, or otherwise communicate music, otheraudio, video, or other forms of media content items to themedia-playback device 102 for playback during travel on themedia-playback device 102. In accordance with an embodiment, a user Ucan direct the input 176 to the user interface 112 to issue requests,for example, to playback media content for playback during travel on themedia-playback device 102.

FIG. 3A is a schematic diagram of an embodiment of a media-playbacksystem 260 shown within an interior cabin of the vehicle V. Themedia-playback system 260 is an example of the media-playback system100.

In this example, the media-playback system 260 includes a media-playbackdevice 262. The media-playback device 262 is an embodiment of themedia-playback device 102. Although not shown in this figure, someembodiments of the media-playback system 260 also include amedia-delivery system such as the media-delivery system 104.

Also shown in this figure, is a vehicle media-playback system 264. Thevehicle media-playback system 264 can provide an external audio sourceA. An example of the vehicle media-playback system 264 is illustratedand described with respect to FIG. 4.

As shown in FIG. 3A, the vehicle media-playback system 264 includes avehicle head unit 266 and a speaker assembly 268. The vehicle head unit266 receives or generates media content signals from one or moresources. As shown in FIG. 2, the vehicle head unit 266 includes adisplay device 270, which can be used to display information about theselected or available sources of media content. In some embodiments, thedisplay device 270 can also display media content. Typically, thevehicle head unit 266 also includes a user input assembly 360 that isusable to control the vehicle media-playback system 264 and to select asource of media content. The vehicle head unit 266 then transmits thosesignals to output devices such as the speaker assembly 268 (sometimesvia an amplifier). In some embodiments, the vehicle head unit 266 alsogenerates image or video content signals that can be transmitted to thedisplay device 270 or another device for display.

In FIG. 3A, the vehicle media playback system 264 is operating as anexternal audio source A while the FM radio is operating. As isillustrated in more detail in FIG. 9, the media-playback device 262 canbe prompted to record external audio content in order to identify theaudio content. Once the audio content is identified, media output 110corresponding to the audio content is played from the media-playbackdevice 262 as shown in FIG. 3B.

FIG. 3B shows media-playback system 260 after audio content has beenidentified and is being played back by the media-playback device 262.The vehicle media-playback system 264 has been turned off and mediaoutput 110 is coming from the media-playback device 262 instead.

In an alternative embodiment of the media-playback system 260 shown inFIG. 3C, the media-playback device 262 is operating in conjunction withan external speaker assembly 282. The external speaker assembly 282generates media output 110 based on a signal received from themedia-playback device 262 to play the identified media content. Theexternal speaker assembly 282 can include one or more speakers. Theexternal speaker assembly 282 can also include a mechanical apparatusfor attachment to the vehicle. Although alternatives are possible, inFIG. 3C, the external speaker assembly 282 is attached to a sun visor ofthe vehicle.

In some embodiments, the media-playback device 262 connects to theexternal speaker assembly 282 using BLUETOOTH®. The media-playbackdevice 262 then transmits an audio signal to the external speakerassembly 282 via Bluetooth, which is then used by the external speakerassembly 282 to generate the media output 110. In some embodiments, themedia-playback device 262 and the external speaker assembly 282communicate using a different wireless protocol. Further, in someembodiments, the media-playback device 262 can transmit a media contentsignal to the external speaker assembly 282 via a cable (e.g., an analogor digital audio cable, a universal serial bus (USB) cable).

In FIG. 3C, the vehicle media-playback system 264 is not being used togenerate media output. Instead, the media-playback device 262 istransmitting a signal to the external speaker assembly 282, whichgenerates the media output 110 without using the vehicle media-playbacksystem 264.

FIG. 3D is a schematic diagram of an embodiment of a media-playbacksystem 300. The media-playback system 300 is an example of themedia-playback system 100. The media-playback system 300 includes themedia-playback device 262, the vehicle media-playback system 264, and acable 302.

The cable 302 connects the media-playback device 262 to the vehiclemedia-playback system 264. Although alternatives are possible, the cable302 can be an analog audio cable that is connects an audio output deviceof the media-playback device 262 (e.g., a headphone jack) to a wiredinput device 354 of the vehicle media-playback system 264 (e.g., anauxiliary input jack). In alternative embodiments, the cable 302 is aUSB cable that connects the media-playback device 262 to the vehiclemedia-playback system 264.

As shown in FIG. 3D, the display device 270 indicates that the vehiclemedia-playback system 264 is generating external audio content Areceived from an FM radio transmission. The external audio A is playedwithin the interior cabin of the vehicle by the vehicle media-playbacksystem 264 using the speaker assembly 268.

In FIG. 3E, the vehicle media-playback system 264 input has switched toauxiliary input from the media-playback device 262. In the processdescribed with respect to FIGS. 6-11, the media-playback device 262 hasrecorded and identified the audio content produced by the vehiclemedia-playback system 264 as external audio A. Now the media-playbackdevice 262 has been selected as the input for the vehicle media-playbacksystem 264. The selection of auxiliary input may be done manually by auser by using a user interface or one or more controls on the vehiclemedia-playback system 264 or by controls on the steering wheel.Alternatively, the media-playback device 262 can send a command to thevehicle media-playback system 264 to change the input from the FM radioto auxiliary input. Once the input has been changed, the media-playbackdevice 262 plays media output 110 through the speaker assembly 268.

FIG. 3F is a schematic diagram of an alternative embodiment of amedia-playback system 300. In this example, the media-playback device262 and the vehicle media-playback system 264 are connected using awireless communication protocol. Although alternatives are possible, themedia-playback device 262 and the vehicle media-playback system 264 canbe connected using Bluetooth. As an alternative, the media-playbackdevice 262 and the vehicle media-playback system 264 can be connectedvia a Wi-Fi network (e.g., using an 802.11 wireless communicationprotocols) or another type of wireless connection.

As shown in FIG. 3F, the display device 270 indicates that the vehiclemedia-playback system 264 has been switched to generate media output 110based on a signal received from a wireless input instead of the FMradio. The media output 110 is played within the interior cabin of thevehicle by the vehicle media-playback system 264 using the speakerassembly 268.

FIG. 4 is a schematic diagram of an embodiment of the vehiclemedia-playback system 264. In this example, the vehicle media-playbacksystem 264 includes a vehicle head unit 266, an amplifier 364, and aspeaker assembly 268.

The vehicle head unit 266 received user input and generates mediacontent from various sources. In this example, the vehicle head unit 266includes a receiver 350, a wireless communication device 352, a wiredinput device 354, a processing device 356, a memory device 358, a userinput assembly 360, a display device 270, and a stored media interfaceassembly 362. The processing device 356 may be similar to the processingdevice 154 and the memory device 358 may be similar to the memory device156.

The receiver 350 receives media content signals from various externalsources. The received signals can then be used to generate media outputby the vehicle media-playback system 264. The receiver 350 may compriseone or more tuners for receiving radio signals such as FM or AM radiosignals. Additionally, the receiver 350 may comprise a receiver forreceiving satellite radio signals.

The wireless communication device 352 communicates with other devicesusing wireless data signals. The wireless communication device 352 caninclude one or more of a Bluetooth transceiver and a Wi-Fi transceiver.The wireless data signal may comprise a media content signal such as anaudio or video signal.

The wired input device 354 receives a cable for providing media contentand/or commands. In some embodiments, the wired input device 354 caninclude an auxiliary input jack for receiving a plug that transmitsanalog audio signals. The wired input device 354 can also includedifferent or multiple input jacks for receiving plugs that transmitother types of analog or digital signals (e.g., USB, HDMI, CompositeVideo, YPbPr, DVI). In some embodiments, the wired input device 354 isalso used to receive instructions from other devices. For example, thewired input device 354 can receive instructions from the media-playbackdevice 102 to change the input of the vehicle head unit 266.

The user input assembly 360 comprises one or more input devices forproviding user input to the vehicle media-playback system 264. In someembodiments, the user input assembly 360 includes multiple knobs,buttons, and other types of input controls for adjusting volume,selecting sources and content, and adjusting various output parameters.In some embodiments, the various input devices are disposed on or near afront surface of the vehicle head unit 266. The various input devicescan also be disposed on the steering wheel of the vehicle or elsewhere.Additionally or alternatively, the user input assembly 360 can includeone or more touch sensitive surfaces, which can be incorporated in thedisplay device 270.

The display device 270 displays information. In some embodiments, thedisplay device 270 includes a liquid crystal display (LCD) panel fordisplaying textual information about content and/or settings of thevehicle media-playback system 264. The display device 270 can alsoinclude other types of display panels such as a light emitting diode(LED) panel. In some embodiments, the display device 270 can alsodisplay image or video content.

The stored media interface assembly 362 reads media content stored on aphysical medium. In some embodiments, the stored media interfaceassembly 362 comprises one or more devices for reading media contentfrom a physical medium such as a compact disc or cassette tape.

The amplifier 364 amplifies a signal received from the vehicle head unit266 and transmits the amplified signal to the speaker assembly 268. Inthis manner, the media output 110 can be played back at a greatervolume. The amplifier 364 may include a power source to power theamplification.

The speaker assembly 268 produces an audio output based on an electronicsignal. The speaker assembly 268 may comprise one or more speakersdisposed at various locations within the vehicle. In some embodiments,separate signals are received for at least some of the speakers (e.g.,to provide stereo or surround sound). The speaker assembly 268 may be anexternal audio source A as well as a source of media output 110.

FIG. 5A is a schematic diagram of an embodiment of a vehiclemedia-playback system 370 that incorporates a media-playback device. Thevehicle media-playback system 370 is an example of the media-playbacksystem 100. The vehicle media-playback system 370 includes a vehiclehead unit 372 and the speaker assembly 268. In this example, the vehiclehead unit 372 incorporates the components and functionality of both amedia-playback device such as the media-playback device 102 and avehicle head unit such as the vehicle head unit 266. The display device270 is showing a user interface screen for selecting and playing backmedia content. In FIG. 5A, the display device 270 shows that the vehiclemedia-playback system 370 is currently playing FM radio. External audiocontent A is played within the interior cabin of the vehicle by thevehicle media-playback system 370 using the speaker assembly 268.Although alternatives are possible, the vehicle media-playback system370 often also includes navigation, climate control, and vehiclemaintenance capabilities too.

In some embodiments, the vehicle media-playback system 370 includes amicrophone or other means of recording audio content. Recorded externalaudio content A is analyzed and identified. Upon identification, thevehicle media-playback system 370 automatically switches the output ofthe vehicle head unit 372 from FM radio (or whichever output wasselected at the time) to streaming audio. The vehicle head unit 372receives media content that corresponds to the recorded external audio Aand immediately plays the media content through the speaker assembly 268as media output 110.

FIG. 6 is a schematic block diagram of the audio identification engine108 depicted in FIG. 2. In some embodiments, the audio identificationengine 108 includes a sampling engine 602, an analysis engine 604, anidentification output engine 606, and a playlist initiation engine 608.

The sampling engine 602 operates to record a portion of the audiocontent from an external audio source A. The recording may be occurringcontinuously in the background or may occur only when initiated by auser. The recording of a sample of audio content may be 10 seconds long,20 seconds long, 30 seconds long, 60 seconds long, or 90 seconds long,for example.

The analysis engine 604 operates to analyze the recorded sample of audiocontent. The audio content is analyzed to determine an acousticfingerprint, which is compared to other fingerprints in one or moredatabases.

The identification output engine 606 operates to retrieve identifyinginformation regarding the audio content and present that information tothe user. The identifying information may include a song title, anartist name, and an album title. The identifying information can bestored as metadata. The identifying information can be displayed and/orannounced to the user.

The playlist initiation engine 608 operates to initiate playback of theidentified media content and a related playlist of media content.Playing the media item can occur by playing the media item from a startof the media item, seamlessly playing the media item where the externalaudio source left off, or playing a distinctive portion of the mediaitem. For example, a car radio could be halfway through a song when theuser requested to identify and playback the song on his or her mediaplayback device. The media playback device could pick up playing thesong at the same point halfway through. Alternatively, the mediaplayback device could begin playing the song at the beginning. Playbackbegins immediately without additional user input. The user does not needto select an option to play the media content or buy the media contentto initiate playback. The user does not need to access or log in to anyother media applications to initiate playback of the media content. Theplaylist is built based on past listening preferences and behaviors ofthe user.

FIG. 7 illustrates an example method 700 of identifying and playingmedia content performed by some embodiments of the audio identificationengine 108. Such a method can be used, for example, when the user istraveling in a vehicle and listening to audio content from a vehiclemedia-playback system, such as the vehicle media-playback system 264 ofFIG. 4. In other embodiments, the user may be listening to audio contentfrom other external audio sources such as a juke box at a bar, a soundsystem at a shopping center, satellite radio at a restaurant, or atelevision set in the user's home.

At operation 702, audio content is recorded. The audio content isexternal to a media-playback device 102, such as the device depicted inFIG. 2. The audio content may be any content that a user wishes toidentify. In some embodiments the audio content is music. In someembodiments the audio content comes from a vehicle speaker assembly 268.

At operation 704, the audio content is analyzed and identified. Amedia-playback device 102 may communicate with a media-delivery system104 to analyze the audio content. The audio content is compared to mediacontent stored in databases and a matching media content item isidentified. At operation 706, the identified media item is played. Insome embodiments, the media item is streamed on the media-playbackdevice 102 using the media server application 184. The media item isautomatically played on the media-playback device 102 without additionalinteraction from the user. In some embodiments, the media-playbackdevice 102 may be linked with a vehicle media-playback system 264 thatplays the media item through a speaker assembly 268.

FIG. 8A illustrates an example method 800 of recording audio contentthat is performed by some embodiments of the audio identification engine108. At operation 802, a command is received from a user to identifyaudio content in the user's environment (e.g., vehicle). At operation804, the audio content is recorded from an external audio source (e.g.vehicle speaker assembly 268). In some embodiments employed in avehicle, switching the vehicle media-playback system 264 to access themedia-playback device 102 will automatically initiate recording of audiocontent.

FIG. 8B illustrates an alternative method 850 of recording audio contentthat is performed by some embodiments of the audio identification engine108. At operation 852, ambient sound from an external audio source isrecorded. This recording may be happening as a background operation onthe media-playback device 102. In some embodiments, the media-playbackdevice 102 records audio continuously but only saves the last 10, 20,30, 60, or 90 seconds of the audio recording. In such instances, theaudio recording is only saved upon receiving a command from the user toidentify the audio content and all other recorded audio is discarded. Insuch embodiments, the media-playback engine 170 pauses any audioplayback while recording is taking place.

At operation 854, a command is received from a user to identify audiocontent is the user's environment. In some embodiments employed in avehicle where the media-playback device 102 is integrated into thevehicle, switching the vehicle media-playback system 264 to access themedia-playback device 102 will automatically initiate identification ofpre-recorded audio content. In other embodiments, a touch or voicecommand initiates identification of the pre-recorded audio content. Atoperation 856, the most recently recorded sample of audio is saved foranalysis.

FIG. 9 shows an embodiment of a media-playback device 102 operatingwithin an interior cabin of the vehicle V. The display 270 of thevehicle head unit 266 indicates that the vehicle media-playback system264 is playing audio from FM radio. The user interface 112 of themedia-playback device 102 shows a screen reading “tap or speak toidentify song.” A user may interact with a touch screen or give verbalinstruction to command the media-playback device 102 to identifyexternal audio content A being played by the vehicle head unit 266. Theexemplary method described in FIGS. 8A-B is initiated upon interactionof the user with this user interface 112.

FIG. 10 illustrates an example method 1000 of analyzing and identifyingaudio content that is performed by some embodiments of the audioidentification engine 108. In some embodiments the method 1000 isperformed in conjunction with the audio analysis server application 185.

At operation 1002 the recorded external audio content is analyzed todetermine an acoustic fingerprint. The audio content is condensed downto a smaller, coded data file. There are multiple techniques which couldbe used to generate an acoustic fingerprint, such as identifyingspecific points in a spectrogram.

In some embodiments, the audio content is analyzed to determine anacoustic fingerprint which is compared to a database of acousticfingerprints to identify a matching media content item. Acousticfingerprints are condensed digital summaries that represent a wholemedia file. Acoustic fingerprints can be based upon spectrograms ofaudio content. A spectrogram is a time-frequency graph that typicallycharts frequency of the audio over time. The spectrogram can alsoindicate the amplitude of the frequency at a particular time by showinggreater intensity at certain points. These points of greater intensityare selected to be the acoustic fingerprint.

At operation 1004, the acoustic fingerprint of the recorded audiocontent is compared to other acoustic fingerprints. In some embodiments,the acoustic analysis engine 220 within the audio analysis serverapplication 185 compares the acoustic fingerprint with those stored inthe acoustic fingerprint data store 221.

At operation 1006, a media item corresponding to the recorded audiocontent is identified. The media item and its accompanying metadata areretrieved from the media data store 196.

At operation 1008, the identifying information for the media item isdisplayed to the user. The identifying information may be metadatastored in the media content metadata 214 storage of the media data store196. In embodiments where the audio content is music, the identifyinginformation can be one or more of a song title, an artist, an album, anda genre. The identifying information can be displayed on a touch screen152 of the media-playback device or may be announced to the user throughspoken words emitted from the content output device 158.

FIG. 11 illustrates an example method 1100 of playing identified mediacontent that is performed by some embodiments of the audioidentification engine 108 in a vehicle. The media item corresponding tothe recorded audio content is automatically played through the contentoutput device 158 of the media-playback device 102. In some embodiments,the media-playback device 102 is connected to a vehicle media-playbacksystem 264 which plays back the media through a speaker assembly 268.The identified media content is immediately played without anyadditional interaction required from the user.

At operation 1102, the input of the vehicle head unit 266 is switched toreceive signals from the media-playback device 102. The media-playbackdevice 102 may communicate with the vehicle head unit 266 through awired connection at the wired input device 354 or through a wirelessconnection at the wireless communication device 352. For example, thevehicle head unit 266 may switch from receiving an FM signal toreceiving a Bluetooth signal in order to playback the media item fromthe media-playback device 102. In another example, the external audiomay come from a CD player or satellite radio. In some embodiments, theswitch is made automatically without any action required from the user.In other embodiments, the user may need to input a command to thevehicle head unit 266 through the user input assembly 360 to change theinput. Such commands may be received directly through controls on thevehicle head unit 266 or indirectly through controls in the vehicle suchas steering wheel controls. In some embodiments employed in a vehiclewhere the media-playback device 102 is integrated into the vehicle,switching the vehicle media-playback system 264 to access themedia-playback device 102 will automatically initiates playback of themedia content at the same point in the media item where the externalaudio source was playing.

At operation 1104, the media item is played through the media-playbackdevice 102. In some embodiments, the media item is transmitted throughthe speaker assembly 268 of the vehicle head unit 266. In someembodiments, playback of the media content resumes at the point in theaudio where the vehicle head unit 266 left off, resulting in a seamlesstransition. In another embodiment, playback begins at the start of themedia item, for example at the beginning of a song. In yet anotherembodiment, playback of the media content begins at a distinctiveportion of the media item, such as the hook of a song.

At operation 1106 additional media content is generated to produce aplaylist of related media items. Media content items 212 and playlists216 are accessed by the media server application 184 to assemble aplaylist of related media content. Such playlist may include mediacontent that the user has indicated a preference for in the past. Thesepreferences may be stored in the memory device 156 of the media-playbackdevice 102 or may be accessed from the memory device 190 of themedia-delivery system. 104. Machine learning may be utilized todetermine a user's listening preferences.

In some embodiments, the media content items 212 and playlists 216selected to be played include content that the user has previouslyviewed or listened to or otherwise indicated as being desirable. Forexample, the media content items 212 can be selected from the user'spersonal media list. The selected media content items 212 can alsoinclude media content from the same artists and genres as content theuser has previously shown a preference for. Additionally, new mediacontent can be included based on criteria such as a relationship toselected media content attributed using artificial learning algorithmsor other machine learning techniques.

At operation 1108, the playlist is dynamically updated to includeadditional media content as the playlist is playing. Additional relatedmedia content can be continuously added to the playlist as long as theuser wishes to listen. In some embodiments, the playlist may be a “radiostation” type playlist that continually updates with new media (songs)that is related to the remaining media. In one embodiment, the mediaitem is a song and the additional media content is song having a relatedgenre, artist, or album as the media item. In this method, the media isrelated to the original song that was identified in operation 1002. Theplaylist may update based on feedback from the user such as “liking” or“disliking” a piece of media content.

FIG. 12 shows an embodiment of the media-playback device 102 displayinga user interface 112 which indicates that a song has been identified.This user interface may correspond to a method such as that of FIG. 10being implemented on the media-playback device 102. The song informationis also displayed on the user interface 112. In some embodiments, thesong information may also be announced to the user verbally. The song isbeing played through a Bluetooth connection to the vehicle head unit266, as indicated by the display 270. Media output 110 is coming fromspeakers (not shown) in the vehicle media-playback system 264 to playthe song to the user traveling in the vehicle.

FIG. 13 shows an embodiment of the media-playback device 102 displayinga user interface 112 which indicates that a customized playlist based onthe identified song has begun playing. This user interface maycorrespond to a method such as that of FIG. 11 being implemented on themedia-playback device 102. In this example, the song “Radioactive” byImagine Dragons was identified and played in FIG. 12. Here, a playlistor “radio station” based on “Radioactive” has been generated. The song“Royals” by Lorde was chosen to be played after “Radioactive” finishedplaying.

As noted previously, although many of the examples provided above aredescribed with respect to traveling in a vehicle, other embodimentsrelate to other forms of travel or even other types of activities inwhich a user wishes to identify audio content. The various embodimentsdescribed above are provided by way of illustration only and should notbe construed to limit the claims attached hereto. Those skilled in theart will readily recognize various modifications and changes that may bemade without following the example embodiments and applicationsillustrated and described herein, and without departing from the truespirit and scope of the following claims.

1-22. (canceled)
 23. A media playback device comprising: a sound-sensingdevice configured to record audio content; a content output device; amemory device encoding instructions that, when executed by a processor,cause the media playback device to: receive a command from a user toidentify audio content in an environment of the user; record a portionof the audio content; communicate the recorded portion of audio contentto a media delivery system; receive identifying information for theaudio content from the media delivery system; receive a media itemcorresponding to the audio content from the media delivery system; andwithout receiving an additional command from the user, initiate playbackof the media item through the content output device.
 24. The mediaplayback device of claim 23, wherein the command is a voice commandreceived by the sound-sensing device.
 25. The media playback device ofclaim 23, wherein the memory device further encodes instructions toreceive a playlist of additional media items from the media deliverysystem, the additional media items being related to the media itemcorresponding to the audio content.
 26. The media playback device ofclaim 23, wherein the memory device further encodes instructions toreceive a playlist of additional media items from the media deliverysystem, the additional media items being related to the media itemcorresponding to the audio content.
 27. The media playback device ofclaim 26, wherein the media item and additional media items are songssharing the same genre, artist, or album.
 28. The media playback deviceof claim 23, wherein the memory device further encodes instructions toplay a verbal announcement before playback of the media item isinitiated, the verbal announcement providing the identifying informationfor the audio content.
 29. The media playback device of claim 23,wherein the memory device further encodes instructions to present theidentifying information for the audio content on a visual display of themedia playback device.
 30. A system for identifying audio content, thesystem comprising: an audio analysis server application configured to:receive recorded audio content from a media playback device; analyze theaudio content; identify an acoustic fingerprint of the audio content;compare the acoustic fingerprint to an acoustic fingerprint data store;and identify a media item having the same acoustic fingerprint as theaudio content; and a media stream service configured to access the mediaitem from a media data store and stream the media item to the mediaplayback device, without receiving an additional request from the mediaplayback device.
 31. The system of claim 30, wherein the audio analysisserver application is further configured to communicate identifyinginformation for the media item to the media playback device.
 32. Thesystem of claim 30, wherein the media stream service is furtherconfigured to access a playlist of additional media items and stream theplaylist to the media playback device, the playlist of additional mediaitems being related to the media item by one or more of genre, artist,album, level of popularity, and mood.
 33. The system of claim 32,wherein the playlist is built based on past listening preferences andbehaviors a user operating the media playback device.
 34. The system ofclaim 32, wherein the playlist is dynamically updated to includeadditional media content as the playlist is playing.
 35. The system ofclaim 32, wherein the playlist is selected using a machine learningalgorithm and is dynamically updated in response to user feedback. 36.At least one non-transitory computer-readable medium havingcomputer-executable instructions embodied thereon that, when executed byat least one computing device, cause the at least one computing deviceto: receive, at a sound-sensing device, a verbal command to identify asong; record, using the sound-sensing device, audio content in theenvironment of the at least one computing device; analyze the audiocontent to identify an acoustic fingerprint; compare the acousticfingerprint of the audio content with a database of acousticfingerprints; identify a song matching the audio content acousticfingerprint; and initiate playback of the song through a content outputdevice of the at least one computing device, without requiringadditional input.
 37. The at least one non-transitory computer-readablemedia of claim 36, wherein playback of the song begins at the start ofthe song.
 38. The at least one non-transitory computer-readable media ofclaim 36, wherein playback of the song picks up where the audio contentleft off.
 39. The at least one non-transitory computer-readable media ofclaim 36, wherein playback of the song begins at a distinctive portionof the song.
 40. The at least one non-transitory computer-readable mediaof claim 36, wherein the instructions further cause the computingdevices to generate a playlist of songs that are related to theidentified song and play the playlist after the identified song hasconcluded playback.
 41. The at least one non-transitorycomputer-readable media of claim 40, wherein the playlist is generatedbased on user preferences that are determined using machine learning.42. The at least one non-transitory computer-readable media of claim 40,wherein the playlist is generated using songs that a user of thecomputing devices has indicated a preference for in the past.